
The iPSC®/860 is an ultra-high- 
performance computer system 
designed to operate in business, 
manufacturing, and research envi- 
ronments. The system’s distrib- 
uted-memory multicomputer 
architecture, and use of standard 
microcomputer components pro- 
duces a scalable system that offers 
both high performance and high 
economy. Peak-rated at 7.6 biUion 
double-precision operations per 
second and configurable with up to 
8 GBytes of system memory, the 
iPSC/860 exceeds the floating point 
performance of any conventional 
vector supercomputer. Priced at less 
than $450 per MFLOPS, the iPSC/ 
860 is also ten times more cost- 
effective than traditional 
supercomputers. Life-cycle costs 
are low as well, since iPSC/860 sys- 
tems are air-cooled, compact and 
reliable. 

The iPSC/860 system comes with 
a comprehensive software suite for 
developing parallel applications 
and adapting existing programs to 
take advantage of Intel’s parallel 
supercomputer architecture. The 
iPSC/860’s UNIX workstation- 
based cross-development environ- 
ment includes performance- 
optimizing compilers for industry- 
standard implementations of the 
FORTRAN and C languages. Sup- 
plementing the compilers are a par- 
allel-oriented CASE development 
environment, debugger, perform- 
ance analyzer, and an iPSC/860 
simulator. High-performance 
mathematical libraries and matrix 
solvers are also available. 

For easy integration into net- 
worked computing environments, 
the iPSC/860 can connect to other 
computer systems through one or 
more high-performance Ethernet 
channels. Support for network 
standards, such as the X-Window 
System, the Network File System 
(NFS), and the FTP and TCP/IP pro- 
tocols, foster integration into 
VAX/VMS, UNIX, and IBM environ- 
ments. 


LEVERAGING STANDARDS FOR 
LOW COST SUPERCOMPUTER 
PERFORMANCE 



ture account for its exceptional per- 
formance and cost-effectiveness. 
The iPSC/860 is based on standard 
commercial microprocessors, mem- 
ories, and disks, and standards- 
based .software and I/O interfaces. 
Intel ’s ten-year investment in para- 
llel systems technologies trans- 
forms these capabilities into an 
integrated multi-function super- 
computer. 

The iPSC/860’s use of commodity 
CMOS memory components and 5 
1/4” disks leverages the enormous 
mass market R&D investment in PC 
and workstation technology. This 
ensures maximum cost-effective- 
ness and also puts future product 
developments on a predictable 
course for further improvements in 
capacity and speed. As the inventor 
and leading manufacturer of high- 
performance microprocessors, Intel 
is now actively exploiting this tech- 
nology for technical computing 
applications. 


The iPSC/860 exceeds the 
floating point performance of 
conventional supercompulers 
at one-tenth their cost. 


The underlying technologies that 
implement the iPSC/860 architec- 





Along with its use of standard 
commercial microprocessor compo- 
nents, the iPSC/860 implemente a 
wide range of industry standards, 
thus increasing interoperability and 
further contributing to lower sys- 
tem costs. iPSC/860 standards 
range from the VME and SCSI I/O 
interface standards through soft- 
ware standards such as the UNIX 
operating system, TCP/IP network- 
ing services, and NFS file access 
services. 


iPSC/860 HARDWARE OVERVIEW 

The iPSC/860 is based on an 
advanced, large-scale parallel 
multicomputer architecture using 
independent computing elements 
that collectively process informa- 
tion in parallel. The computing ele- 
ments, called nodes, are each a 
fully functional computer system 
containing a CPU, local memory, 
and I/O facilities. 

Communication among nodes is a 
critical performance element for 
parallel processing computers. 
iPSC/860 nodes communicate via 
Intel’s proprietary, high-speed 
Direct-Connect™ internal network, 
passing messages that contain 
instructions or data among the 
nodes. This internal network is 
inherently scalable and allows com- 
munications capacity to grow pro- 
portionately with the number of 
processors. 

This simple but powerful archi- 
tecture, with its distributed pro- 
cessors, on-board memory and 
message-passing communications. 


Figure 1: The iPSC/860 ’s distributed 
memory, message-passing architecture 
provides exceptional parallel efficiency 
and performance. 

produces several significant bene- 
fits: 

• performance: because the inter- 
nal network can be expanded to 
accommodate a large number of 
processors; 

• economy: because individual 
nodes can be constructed from 
low-cost CMOS components; and 

• flexibility: because of the the sys- 
tem’s inherent modularity, scala- 
bility, and ability to support varied 
programming models. 

The iPSC/860 provides excep- 
tional parallel efficiency up to the 
maximum number of processors 
since resource bottlenecks are min- 
imized. By contrast, parallel com- 
puters that use shared memory 
architectures must contend with a 
single scarce resource: memory 
bandwidth. This contention limits 
the effective use of parallel process- 
ing to eight or fewer high-perform- 
ance CPUs. Facing this Umitation, 
manufacturers must resort to 
extremely expensive components 
based on ECU or galUum arsenide 
semiconductor technologies to 
achieve high system performance. 
These tradeoffs severely limit the 
cost-effectiveness of such systems. 

Intel’s multicomputer architec- 
ture produces a system that is inher- 
ently upgradable. Customers can 
make a modest investment in super- 
computing, then scale up their sys- 


tem as their needs grow. iPSC/860 
systems are scalable from 8 pro- 
cessors to 128 processors, with peak 
performance ranging from 480 
MFLOPS for an 8-processor system 
to 7.6 GFLOPS for a system with 
128 processors. Memory is scalable 
as well. Each processing node can 
have 8 to 64 MBytes, providing total 
system memory ranging from 64 
MBytes to 8 GBytes. 

The iPSC/860 ’s distributed mem- 
ory, message-passing architecture 
also provides the flexibility to mix 
node types in a single system and to 
dedicate nodes to specific tasks 
such as processing or I/O. 

Finally, Intel’s supercomputer 
architecture supports a Multiple 
Instruction Multiple Data (MIMD) 
computational model that stands in 
contrast to the Single Instruction 
Multiple Data (SIMD) model used 
by several of today ’s massively para- 
llel supercomputers. The SIMD 
model severely restricts program- 
mers by requiring that processors 
execute identical programs in lock- 
step fashion . The iPSC/860 ’s MIMD 
model, on the other hand, gives 
programmers the flexibility to 
tackle problems with complex 
boundary conditions, for example, 
or problems that have many subpro- 
grams of differing computational 
complexity. 


NUMERIC PROCESSING NODES 

The iPSC/860 performs its 
numeric computations on special- 
ized processing nodes. Each 
numeric node has is own CPU, ker- 
nel operating system, local mem- 
ory, and interface to the 
Direct-Connect internal network. 

The CPU is based on a 40 MHz 
Intel i860 microprocessor, the 
world’s first supercomputer-on-a- 
chip. Each i860 processor contains 
a 64-bit RISC processor, a floating- 
point adder, a floating-point multi- 
plier, and 8 and 16 Kbyte caches for 
instructions and data. Peak-rated 
at 60 MFLOPS double precision and 
80 MFLOPS single precision, a sin- 
gle i860 microprocessor actually 
exceeds the scalar processing per- 
formance of a Cray 1 supercom- 
puter. Intel’s single-chip, 
million-transistor implementation 
of the i860 ensures computational 



860 nodes and to the System 
Resource Manager disk files. Con- 
current I/O disk files can be 
accessed directly via the File Thans- 
fer Protocol (FTP). 

For interactive processing, the 
iPSC/860 offers the graphical inter- 
face of the industry-standard X- 
Window System, UNIX sockets, 
and the iPSC/860’s Remote Host 
facilities. X Window System client 
services allow iPSC/860 nodes to 
communicate with distributed 
workstations over the network. 
The UNIX Sockets interface can be 
used to create custom process-to- 
process communications. Remote 
Host is a network communications 
facility that extends the iPSC/860’s 
System Resource Manager user and 
application interface onto Sun 
workstations. Sun-to-iPSC/860 
communications are made trans- 
parent to the user, allowing users 
and developers to avoid the distrac- 
tion of remote logins. Remote Host 
allows applications running on the 
workstation to communicate with 
the iPSC/860 via Intel’s message 
passing, as if the workstation were 
another node in the system. 

The iPSC/860’s capabilities as a 
network server include batch-style 
processing using the Network 
Queuing System (NQS), an indus- 


try-standard batch manager for 
UNIX chent/server environments. 
With NQS, iPSC/860 users can sub- 
mit jobs in the form of UNIX shell 
scripts or command files. Multiple 
job queues and multiple priorities 
enable a mix of small tasks to run in 
small partitions of the system while 
large tasks are dedicated to a parti- 
tion of many numeric nodes. NQS 
also lets users direct input and out- 
put to remote printers, file servers 
and workstations on the network. 


LANGUAGES AND SOFTWARE 
DEVELOPMENT TOOLS 

For software developers, Intel 
provides a rich environment of 
cross-development tools that run on 
UNIX workstations via network 
connection to the iPSC/860. 
FORGE® and CAST™, state-of-the- 
art CASE tools for parallel program- 
ming, are available as aids to 
porting and managing parallel 
FORTRAN applications. FORGE 
offers powerful facilities for analyz- 
ing the structure and data flow of 
FORTRAN programs, while CAST 
provides interactive structure edi- 
tors and FORTRAN code generators 
that transform applications into 
parallel programs ready for compi- 
lation and execution. Graphical 
performance analysis tools are 


available for performance tuning of 
parallel applications. 

For outstanding run-time per- 
formance, iPSC/860 C and iPSC/ 
860 FORTRAN offer sophisticated 
optimization techniques that 
ensure effective pipeline uti liz ation 
and efficient cache management. 
To simplify porting and program- 
mer training, the compilers con- 
form to industry standards. 
iPSC/860 C conforms to the pro- 
posed ANSI C standard (X3JII), 
while iPSC/860 FORTRAN is a 
superset of FORTRAN?? that 
includes VAX/VMS 4.0 and IBM/VS 
FORTRAN extensions. The compil- 
ers also have language-compatible 
libraries for iPSC/860 message- 
passing and file system services, as 
well as both IEEE-compliant float- 
ing point run-time libraries . Ada for 
the iPSC/860 is under development. 


LIBRARIES AND SOLVERS 

The iPSC/860 ProSolver family of 
matrix solvers has been specifically 
designed and tuned for the iPSC/ 
860. The family includes the Pro- 
Solver-DES direct equation solver, 
the ProSoIver-IES iterative equa- 
tion solver, and the ProSoIver-SES 
skyline equation solver. ProSolver 
matrix solvers can be used as the 
starting point for new applications 


iPSC 860 System Specifications 


Number of CPUs 8 

Peak Performance (GFLOPS) 

Double Precision .48 

Single Precision .64 

Memory Capacity (MBytes) 

Standard 64 

Maximum 12 

Storage (GBytes) 

Standard .65 

Maximum 9.1 

Internal Network (MBytes/sec.) 22.4 

(Bisection Bandwidth) 

External I/O 

Available Channels ? 

Maximum UO (MBytes/sec. ) 19.6 

Physical Size (feet) 

Width 1.8 

Height 5.0, Depth 2.1 

Operating Temperature (°C) 10-30 
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density and exceptional perform- 
ance. 

Complementing the i860 micro- 
processor, numeric nodes also con- 
tain from 8 to 64 MBytes of 
on-board memory. This combina- 
tion of high-performance CPU with 
large local memory means each 
node can support a wide selection of 
programming models, algorithms, 
and tools. This approach also sim- 
plifies the program development 
process, allowing programmers to 
run and test programs on a single 
node before parallellizing the pro- 
gram. Other parallel supercompu- 
ter systems lack this basic facility, 
forcing developers to effect a com- 
plete and flawless parallel imple- 
mentation in a single step . 


INPUT/OUTPUT NODES 

In keeping with its architectural 
philosophy, the iPSC/860 uses spe- 
cialized I/O nodes to provide a scal- 
able I/O facility that can be 
configured to meet specific applica- 
tion requirements. Based on the 
Intel 386 microprocessor, I/O nodes 
can control peripherals, external 
interfaces and local area networks. 

For example, each I/O node can 
have an industry-standard 4 MBy- 
tes/sec SCSI interface. Using 760- 
MByte 5.25-in disks, disk arrays 
can be configured with as much as 
165 GBytes of storage. Intel’s Con- 
current File System (CFS) software 
combines high-performance block 
interleaving with standard UNIX 
file interfaces to simplify data man- 
agement for disk arrays and make 
parallel data access to large num- 
bers of disks transparent to users. 

The iPSC/860’s I/O nodes also 
allow use of other mass storage 
options such as 8mm cartridge tape 
for file backup and nine-track reel- 
to-reel tape drives for data inter- 
change. In addition to the SCSI 
interface, I/O nodes can provide 
VME support for various data acqui- 
sition boards and custom inter- 
faces. 

For networking to other comput- 
ing resources, the iPSC/860 offers 
multiple TCP/IP Ethernet chan- 
nels. Dedicated network nodes per- 
form protocol processing and data 
buffering; the network nodes also 
support a UNIX sockets software 
interface. 


DIRECT-CONNECT 
INTERNAL NETWORK 

The iPSC/860’s Direct-Connect 
internal network provides high- 
speed data pathways between all 
nodes of an iPSC/860 system and 
supports literally hundreds of 
simultaneous processor-to-pro- 
cessor conununications with uni- 
form performance between all 
nodes. Each channel is a bi-direc- 
tional pathway that delivers data at 
5.6 MBytes/second. Automatic 
switching hardware frees the pro- 
grammer from the details of mes- 
sage routing, making Direct- 
Connect easy to use. 

The number of available chan- 
nels increases with system size, 
maintaining a balance between 
conununications performance and 
processing performance even as the 
system grows in size. This expand- 
ability is a key distinction between 
iPSC/860 systems and traditional, 
bus-based parallel architectures. 
Because of Intel’s Direct-Connect 
technology, iPSC/860 systems can 
scale to 128 compute nodes and 127 
I/O nodes, while conventional 
multiprocessor systems are limited 
to 8 processors. 

Along with providing an internal 
pathway between processors, the 
Direct-Connect data highway also 
provides channels from individual 
processors to the iPSC/860 ’s System 
Resource Manager (SRM). 


SYSTEM RESOURCE MANAGER 

The System Resource Manager 
oversees system allocation and sys- 
tem diagnostics services. The SRM 
is responsible for allocating and 
deallocating portions of the iPSC/ 
860 system to various users. As the 
iPSC/860’s diagnostic console, the 
SRM also contains a comprehensive 
set of system test routines that assist 
in isolating faults. 

In addition to these primary func- 
tions, the SRM also supports tasks 
such as file storage, network gate- 
way services, and a UNIX System V 
software development environ- 
ment. For maximum system 
throughput, these functions are 
ordinarily performed by I/O nodes, 
while the software development 
tools run on Sun workstations con- 
nected to a shared Ethernet net- 
work. 


PACKAGING AND 
ENVIRONMENTAL ISSUES 

The iPSC/860 uses a modular, air- 
cooled design that keeps expansion 
simple and facility costs low. Even 
the largest standard system config- 
uration, with 128 numeric nodes 
and a complementary I/O facility, is 
maintained in a normal computer- 
room environment and ogcupies 
less than 12 square feet of floor 
space. iPSC/860 systems run on 
standard AC power at 208 volts and 
less than 30 Amps per cabinet. And 
since iPSC/860 systems are air 
cooled, there’s no need for the spe- 
cialized plumbing and cooling that 
can drive up the operating costs of 
traditional supercomputers. 


iPSC/860 SOFTWARE OVERVIEW 

Intel offers a comprehensive soft- 
ware system optimized for the iPSC/ 
860 ’s multicomputer architecture 
and designed for efficient usage in a 
distributed computing environ- 
ment. 

For software development, the 
iPSC/860’s workstation-based 
cross-development tools allow pro- 
grammers to remain in their famil- 
iar program development 
environment and keep the super- 
computer’s processing power dedi- 
cated to user applications. Intel 
offers mathematical libraries and 
parallel solvers that reduce pro- 
gramming effort and parallel CASE 
tools and a parallel debugger that 
aids in obtaining high run-time per- 
formance. 

System management functions 
allow users to share the computer. 
Topically, users prepare a program 
or data set at a local workstation, 
then obtain some or all of the super- 
computer ’s processing nodes for 
testing or production runs. For 
example, one user might be granted 
half of a system’s available pro- 
cessorstorunaparticularjob, while 
four others run concurrently, each 
on one eighth of the system’s proc- 
essing nodes. Each user’s program 
operates independently and con- 
currently. Users can use the system 
interactively or take advantage of 
the system’s UNIX batch facility for 
deferred execution. Programs and 
data can reside on the iPSC/860’s 
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System Resource Manager, on its 
Concurrent File System (CFS), on 
the user’s workstation, or at any 
NFS file server on the network. 


iPSC/860 OPERATING SYSTEM 

iPSC/860 system software 
emphasizes performance, UNIX 
standard application interfaces, 
and ease of use. Based on a light- 
weight kernel operating system, 
called NX/2, the iPSC/860 main- 
tains a consistent run-time envi- 
ronment while offering fast and 
flexible message passing services. 
NX/2 resides on each of the com- 
pute nodes and I/O nodes in the sys- 
tem and provides synchronous, 
asynchronous and interrupt-driven 
communication services with auto- 
matic flow control and buffering as 
well as support for global opera- 
tions. By maintaining compatibility 
with UNIX I/O services, NX/2 also 
simplifies the effort required to port 
new applications to the iPSC/860. 


FILE SERVICES 

The iPSC/860’s Concurrent File 
System (CFS) provides a UNIX- 


compatible file system while hiding 
the concurrency of multiple I/O 
nodes and disks. Limited in size 
only by the aggregate space of all 
the disks in the system. The CFS 
can support file sizes of many 
GBytes without partitioning. 

The Concurrent File System exe- 
cutes reads and writes to multiple 
disks in parallel, for greater per- 
formance. The CFS manages paral- 
lel I/O through a method that is 
similar to disk striping but more 
efficient because it does not require 
users to synchronize reads or 
writes. The file system automati- 
cally distributes the blocks of indi- 
vidual files across all available 
disks, using algorithms for reading 
and writing that allow several inde- 
pendent applications to run simul- 
taneously. Programmers thus gain 
the performance of parallel disk 
access while keeping the simplicity 
of a single, non-partitioned file. 

Filing services also support nine- 
track and 8mm tape drives. 

NETWORK SERVICES 

The iPSC/860 provides a rich set 
of features for using the supercom- 


Figure 2: The iPSC/860 ’s system compo- 
nents create an efficient, easy-to-use 
multifunction supercomputer designed 
for performance, economy, and flexi- 
bility. 

puter in a networked environment. 
The system uses Ethernet and TCP/ 
IP software as the base for an exten- 
sive set of standard networking 
capabilities. Dedicated I/O nodes 
with hardware-assisted TCP/IP pro- 
tocols provide a high-performance, 
industry-standard connection to a 
wide variety of systems. Through 
the use of gateways, TCP/IP can be 
extended to connect to non-UNIX 
systems such as DEC VAX/VMS and 
IBM MVS computers. Multiple I/O 
nodes can expand network connec- 
tions and throughput while keeping 
networking tasks from interfering 
with applications running on the 
iPSC/860. 

In addition to TCP/IP network- 
ing, afull complementof standards- 
based applications-level software is 
available for file access, interactive 
processing, and batch processing 
on the network. The Network File 
System (NFS) is provided to iPSC/ 


development, or they can be incor- 
porated into existing codes as part 
of the parallel conversion process. 
The result is quick time to market 
and maximum applications per- 
formance. 

Intel also offers the widely used 
BLAS and NAG math libraries. 


INTEL SUPPORT SERVICES 

To help customers fully exploit 
the power of their iPSC/860 super- 
computer, Intel provides extensive 
technical support and consultation, 
including end-user training, appli- 
cations consulting, benchmarking, 
systems integration support, and 
porting services. Intel also oflfers a 
comprehensive system mainte- 
nance program supported with 
trained technicians in each of 
Intel’s 120 offices worldwide. 

Intel, iPSC, i860, Concurrent File System, Direct-Connect and NX/ 
2 are registered trademarics of Intel Coiporation. All other brand 
and product names are trademarks of their reflective owners. 
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